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Abstract — Cuckoo hashing with a stash is a robust multiple 
choice hashing scheme with high memory utilization that can 
be used in many network device applications. Unfortunately, for 
memory loads beyond 0.5, little is known on its performance. 

In this paper, we analyze its average performance over such 
loads. We tackle this problem by recasting the problem as an 
analysis of the expected maximum matching size of a given 
random bipartite graph. We provide exact results for any finite 
system, and also deduce asymptotic results as the memory size 
increases. We further consider other variants of this problem, 
and finally evaluate the performance of our models on Internet 
backbone traces. More generally, our results give a tight lower 
bound on the size of the stash needed for any multiple-choice 
hashing scheme. 



I. Introduction 
A. Background 

Network devices increasingly rely on hash tables to ef- 
ficiently implement their algorithms, in fields as diverse as 
load-balancing, peer-to-peer, state management, monitoring, 
caching, routing, filtering, and security Ql-(6|. 

Because of the stringent memory size constraints in network 
devices, recent research is increasingly dealing with improving 
the memory efficiency of hash tables. In particular, cuckoo 
hashing has recently drawn a lot of attention due to its efficient 
space utilization along with its constant query and deletion 
times, as well as its constant expected insertion time (e.g., JTJ- 
|[T8l and references therein). In cuckoo hashing, we want 
to insert n elements into m unit-sized bins. Each of the n 
elements typically uses d = 2 independent hash functions, 
each pointing to an arbitrary bin. When an element arrives, it is 
placed in one of these 2 bins. If both bins are full, it displaces 
another element, which is then moved to the bin corresponding 
to its other choice. This process continues until all elements 
are placed, unless it is stopped and then the element cannot 
be inserted. 

Cuckoo hashing is especially interesting because of its high 
memory utilization. In fact, consider the bipartite graph formed 
by the n elements on one side, the m bins on the other, 
and 2 links leaving each element for 2 bins according to the 
hash values of the element. Then the number of elements that 
cuckoo hashing inserts successfully is exactly the size of the 
maximum matching ifTTIl . fl9l . i.e. it is extremely sufficient. 
Past papers have in fact shown that up to a load n/m = 0.5, all 
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elements could fit in the hash table with high probability Q, 

Unfortunately even this high efficiency of cuckoo hashing is 
not sufficient for network devices, because designers typically 
consider that at load 0.5, half the memory is lost. This is why 
memory-efficient hashing schemes attempt to pack even more 
elements by introducing an additional memory, called stash or 
overflow list, that stores a small number of elements outside 
the main hash table 0, lfT4l - lfT7ll . The stash can then be 
implemented in hardware using for instance CAMs (content- 
addressable memories), which rely on associative memory and 
consume significantly more power (6), ifTSl . It can also rely on 
another hash table, or simply correspond to dropped packets 
in a lossy hash table [20|. 

This memory-efficient architecture with a stash enables the 
load of the hash-table to increase beyond 0.5. Unfortunately, 
when the load gets beyond 0.5, sizing the stash is not fully un- 
derstood [18, Open Question 5]. This paper is about analyzing 
this challenging case where the load exceeds 0.5. 

To further understand why cuckoo hashing has a high 
utilization, it is important to notice that online cuckoo hashing 
with d = 2, succeeds in inserting an element if and only if 
an augmenting path originating from the corresponding vertex 
exists Hill . This is because inserting an element into a cuckoo 
hash table is equivalent to finding an augmenting path in 
the corresponding graph (that is, a path that starts from the 
vertex corresponding to the considered element, and alternates 
between unmatched and matched edges until it ends at a right- 
side vertex whose all edges are unmatched). Notice that for 
any sub-path (n, v, r<2), where v is a left-side vertex and n , r^ 
are right-side vertices, (ri,v) must be a matched edge and 
(v, r<z) an unmatched edge. Intuitively, this corresponds to 
moving element v from bin r\ to bin r^. Since maximum 
size matching can be computed by finding such augmenting 
paths, when considering each left-side vertex only once and in 
arbitrary order, we can immediately conclude that the number 
of elements that a cuckoo hashing inserts successfully is 
exactly the size of the maximum matching. For example, all 
n elements can be inserted if and only if the corresponding 
graph has a perfect matching (namely, a maximum matching 
of size n; see IfTTIl for more details). 

B. Contributions 

In this paper, we attempt to model the behavior of cuckoo 
hashing with a stash as the load gets beyond 0.5. To do so, 
we essentially transform the problem into a problem in graph 
theory, then provide a theoretical analysis of its performance, 



and later evaluate its real-life behavior by using Internet 
backbone traces. 

First, we study the average performance of cuckoo hashing 
by analyzing the expected maximum matching size in the 
bipartite graph introduced above. We decompose each random 
bipartite graph into connected components, and then separately 
analyze each component and evaluate the size of its local 
maximum bipartite match. The size of the maximum bipartite 
matching is the sum of the sizes of all local matches. Then, we 
count the number of connected components in the graph and 
thus derive the size of the maximum matching in the entire 
graph. Surprisingly, we can obtain an exact expression of the 
average performance of cuckoo hashing with a stash in any 
finite system. 

We further show that the actual maximum matching size 
is sharply concentrated around its expected value. Thus, the 
difference between n and the expected maximum matching 
size provides the required size of the stash, which should 
store all elements with high probability. To do so, we use 
concentration results based on applying Azuma's inequality 
to a Doob martingale, which is defined over the maximum 
matching size when exposing vertices one at a time. In 
practice, the goal of this result is to help designers size 
their CAM stashes by providing guarantees on the number 
of elements that can be inserted in the hash table with these 
stashes. 

We next provide an exact analysis when the average number 
of choices is less than 2 to minimize the number of memory 
accesses. We further obtain a lower bound on the required 
stash size when the number of hashes d exceeds 2. Our results 
for d > 2 rely on Huisimi tree enumerations. They illustrate 
the tradeoff between an improved memory efficiency and the 
need for more memory accesses, i.e. the tradeoff between 
memory size and bandwidth. 

Finally, we evaluate cuckoo hashing with a stash on real- 
life Internet packet traces from an OC192 backbone link, using 
a 64-bit mix hash function. We show that when n — m, we 
can insert an average of 83.81% of the packets within the hash 
table, and put the remainder in the CAM stash. Likewise, when 
n = 0.6m, that is, 20% more than the threshold for perfect 
matching, we can insert in average sa 0.9938n of the packets, 
that is, the need stash size is « 0.0062n. We further confirm 
our analytical models and show that our bounds for d > 2 are 
typically within 1% of the exact value. 

Incidentally, our paper can lead to two interesting contribu- 
tions. First, the paper analysis also provides exact results for 
the stash sizes when the numbers of elements n and buckets m 
are finite. This non-asymptotic analysis is particularly needed 
when n and m are known to be small. For example, as 
suggested in Q, the cuckoo hashing scheme can be used to 
store fingerprints of elements and thus enable set membership 
queries. To be able to move a fingerprint from one bucket to 
another, the hash value can only depend on the current location 
and on the fingerprint itself. This can be modeled by cuckoo 
hashing with a small finite number of elements and buckets, 
implying again that an asymptotic analysis cannot be applied 
in this case. 

In addition, we note that for other multiple-choice hashing 



schemes, our results provide a lower bound on the size 
of the stash. This is because the maximum matching size 
of the graph is always an upper bound on the number of 
elements that can be inserted into the hash table. Moreover, 
since finding the maximum matching in bipartite graphs is 
a fundamental problem with a wide range of applications 
in computer science, we believe that our results have also a 
theoretical significance and may be used in other contexts. 

Paper Organization: We start by surveying the relevant 
literature in Section HI] Then we introduce the preliminary 
definitions in Section [HI] Section [IV] provides the expected 
maximum matching size of random bipartite graphs with left- 
side vertex degree 2, where a variation of the problem in 
which each left-side vertex degree is at most 2 is considered in 
Section [V] Next, in Section IVll we solve the more appealing 
problem in which the right-side vertices are partitioned into 
two subsets, and each left-side vertex has exactly one edge to 
each of these subsets. In Section IVHII we verify and evaluate 
our results, including by real-life trace-based experiments. 
Last, Section IVHI provides an upper bound on the expected 
maximum matching size when the constant left-side vertex 
degree is at least three. For the sake of readability, most of 
the proofs are presented in Appendix lAl 

II. Related Work 

Multiple-choice hashing schemes were first considered in 
the seminal paper of Azar et al. [21]. It showed that placing 
each element in the least occupied bin among a constant 
number d of random bins significantly improves the maximum 
bin load to lo f ' ^" +0(1) with high probability (compared 
to the case where d — 1, in which the maximum bin load is 
logn(l -(- 0(1))). This result initiated an extensive research 
with many variants of multiple-choice hashing schemes, which 
typically exhibited the so-called power of two random choices 
with d = 2 l22l . For brevity, we next survey only works that 
directly correspond to our paper. 

First, we relate to works which considered the same model 
as in this paper (a random bipartite graph with constant left- 
side vertex degree). Motivated by achieving a performance 
guarantee for the cuckoo hashing scheme [23], the main effort 
has been to find a load threshold, such that for any load below 
the threshold a perfect matching exists with high probability. 
It is known that a cuckoo hashing scheme with d = 2 
succeeds with high probability if the load is less than a load 
threshold of 0.5, but fails when the load is larger than 0.5 [7|. 
Recent works [9|-[11| have settled the problem of finding the 
corresponding thresholds for d > 2. Another recent work |[T6ll . 
shows that cuckoo hashing with a stash of size s, d = 2, and 
a load factor less than 0.5 fails with probability (n~ s ). Our 
paper differs in that we also consider load values beyond 0.5 
for d — 2. Moreover, while most of the works investigate 
only the asymptotic behavior, we also present in our paper 
analytical expressions for finite random graphs along with the 
asymptotic ones. 

The problem of finding the expected maximum matching 
size is also investigated assuming other models of random 
graphs, mainly trees. In ll24l (and references therein) the 



authors investigate the expected maximum matching size of 
an (r, s)-tree, finding that for almost all (n, n)-trees the 
percentage of dark vertices in a maximum matching is at 
least 72%. A more recent work 11251 presents results related to 
the expected maximum matching size of the class of simply- 
generated trees. A model of a loop graph is considered by l26l . 
showing a lower bound on the expected maximum matching 
size. While using the cavity method of statistical physics |27|, 
the authors find analytically the value under consideration for 
the Erdos graph G(n,c/(n — 1)), where c < 2.7183. Our 
paper differs in that it considers a different model of random 
bipartite graphs, where each vertex in L chooses a constant 
number of vertices in R. 

Additional related works deal with the probability of a 
perfect matching in other random graph models. For instance, 
in a random directed bipartite graph with n left-side and n 
right-side vertices, and an outward degree d at each vertex, 
the probability that the random bipartite graph contains a 
perfect matching approaches 1 if d > 1, but approaches 
otherwise [28 1 . Also, in a random bipartite graph with n left- 
side vertices, n right-side vertices, en edges picked uniformly 
at random, and a degree of at least 2, there is a perfect 
matching with high probability |29| . 

Finally, conjectures in |30|, [31 1 consider the expected mini- 
mum matching weight given a full bipartite graph with random 
exponentially distributed edge weights. These conjectures are 
proved in 11521. l33l. 

III. Definitions and Problem Statement 

Given two disjoint sets of vertices L and R of size n and 
to respectively, we consider a random bipartite graph G = 
(L + R,E), where each vertex v £ L has d = 2 outgoing 
edges whose destinations are chosen independently at random 
among all vertices in R. We allow both choices to be the same 
vertex, implying that G might have parallel edges. For brevity, 
we sometimes say that v E L chooses a vertex v' £ R if (v, v 1 ) 
is in E. The load of G is denoted by a = — . 

We further consider cases when the averages number of 
choices is less than 2. 

Definition 1: Let d v be the number of choices of each 
vertex v £ L. The average number of choices a is the average 

, ^ •, , ■ E (£„<=f d «) T. cr E(d„) 

left-side vertex degree, i.e. a = — — ^= — - — " gij — -. 

First, in the deterministic case, we find the expected max- 
imum matching size of the graph G a = (L + R,E), where 
each vertex v € L independently chooses a predetermined 
number d v G {1)2} of random vertices in R, such that 

_ d 1 +2-d 2 
n 

Second, in the random case, we analyze the slightly different 
case of a random bipartite graph G p = (L + R, E) where each 
vertex chooses two vertices with probability p and one vertex 
with probability 1 — p. This implies that in G p , the average 
number of choices a = 1 + p. 

Finally, we also consider a static partitioning of the choices; 
the set R is partitioned into two disjoint sets R u and Rd of 
sizes (3 ■ m and (1 — /?) m. In that case, we consider a random 
bipartite graph Gp — (L + (R u U Rd),E), where each vertex 
v E L chooses exactly one vertex in R u and another vertex in 
Rd- 




Fig. 1. An example bipartite graph with left-side vertex degree 2 



This paper focuses on the expected size of the maximum 
size matching of G, which is captured by the following 
definition: 

Definition 2: The operator /i (•) extracts the expected size of 
the maximum size matching. It operates both on deterministic 
and random bipartite graphs. (Namely, for a deterministic 
graph H, /i (H ) is simply the size of the maximum size 
matching of H.) 

Definition 3: The normalized limit expected maximum 
matching size 7 = lim n ->oo ^^ is the limit percentage of 
the expected maximum matching size (out of the number of 
the vertices in L). 

As shown in the literature, in real-world systems, practical 
hash functions usually work as if they were fully random [6]. 
Therefore, we model the hash functions in our theoretical 
analyses as such. 

Our goal is to find both the expected maximum matching size 
as well as the normalized limit expected maximum matching 
size for the above-mentioned graph models. 



IV. Expected Cuckoo Performance: Bipartite 
Graphs with d = 2 

We are now interested in evaluating the expected perfor- 
mance of cuckoo hashing with a stash. As explained in the 
Introduction, we approach the problem using a graph-theory 
perspective, since it is the same as evaluating the expected 
maximum matching size of the random bipartite graph G. 

To do so, we consider the connected components of the 
random bipartite graph G. We start by stating some lemmas 
on these connected components, before establishing our main 
result on the expected matching size. 

A. Expected Maximum Matching Size 

We now deal with a random graph, in which each left-side 
vertex chooses d = 2 right-side vertices (parallel edges are 
allowed). Note that further evaluation of the results reported 
here appears in Section I VIII I 

First, we quote a few useful lemmas (proved in [19]), before 
stating our result. As stated, the following lemmas are for 
a given bipartite graph H = (Ljj + Rh,Eh), where each 
vertex in Lh has degree 2 (parallel edges are allowed), with 
\Lh\ = s and \Rh\ = Q- An example bipartite graph s = 3, 
q = 4, and left-side vertex degree 2, appears in Figure Q] 
Dashed lines represent edges not in the maximum size match- 
ing, while solid lines represent edges in the maximum size 
matching. 

Lemma 1: If s < q — 2, then H is not connected. 



Lemma 2: If H is connected and s > q, then /i (H) = q. 

Lemma 3: If H is connected and s = q— 1 then /i (if) = s. 

Lemma 4: For any graph with s = # — 1, H is connected 
if and only if it is a tree. 

Lemma 5: The number T s of connected bipartite graphs H 
whose | £# | = 3 and \R H \ = s + 1 is T s = (s + l) s_1 s! 

We can now prove the next theorem on our bipartite graph 
G, which is the main result of this paper. We remind that this 
theorem states the expected number of elements p (G) that 
can be inserted by our cuckoo hashing scheme with a stash. 
Therefore, n — p (G) also gives us the expected stash size. 

Theorem 1: Let d = 2 and b = rnin{n, m— 1}. The 
expected maximum matching size p (G) is 
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Proof: Let M be a maximum matching of G Our proof 
is based on counting the expected number of vertices in R 
that are not part of M, and on the decomposition of G into 
its connected components. 

Lemma [T] yields that any connected component of G with s 
left-side vertices has at most s+1 right-side vertices. We call a 
connected component with s left-side vertices and s + 1 right- 
side vertices a deficit component of size s. Lemma [3] implies 
that the maximum matching size of any such deficit component 
is s. Therefore, exactly one of its right-side vertices is not part 
of M. Notice that in all other connected components, where 
q < s + 1, the maximum matching size of G is exactly q 
(Lemma |2j, implying that all their right-side vertices are part 
ofM. 

Thus, in order to calculate the size of M, it suffices to count 
the number of deficit components x. The size of M is m — x 
because exactly x right-side vertices do not participate in M, 
one for each deficit component. 

Let P s — , 2 ^T^is be the probability that a bipartite graph 
H = (Ljj + Rh, Eh) is connected, with degree 2 for all 
vertices in Lh, where \Ljj\ = s and \Rh\ =3 + 1. 

The expected number of deficit components of size s is 

©CTxMl- ^) 2( ' l ^(^) 2S ^-The above expression 

consists of the following factors (in order): 

(i) choosing the s vertices in L; 

(ii) choosing the s + 1 vertices in R; 

(Hi) the probability that all s + 1 vertices in R may be 

connected only to the chosen s vertices in L; 

(iv) the probability that all s vertices in L are only connected 

to the s + 1 vertices in the right side; and, 

(v) the probability that all chosen vertices are connected. 

Finally, we calculate x by summing over all possi- 
ble values on s. As mentioned before, the expected size 
of M is given by m — x. We get: p (G) = m — 
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min {n, m — 1}, P s — 
as found in Lemma [5] 
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andT s = (s + l) s_ -(s+ 1)! 



B. Concentration Result 

We next show that the size of the maximum matching is 
highly concentrated around its expectation p(G). In other 
words, this means that our stash occupancy will be close to its 
average value, which can help us size the stash more accurately 
by providing performance guarantees on its performance. 

In order to prove this result, we apply Azuma's inequality 
to a Doob martingale (more specifically, the martingale is a 
vertex exposure martingale of the left-side vertices). 

Note that as long as all left-side vertices pick their edges 
independently, this concentration result holds regardless of 
the value of d, and more generally regardless of the specific 
distribution over which the hash functions are defined. There- 
fore, the concentration result applies also for the settings of 
Sections El -EH 

Theorem 2: Let H be a specific instance of the ran- 
dom graph G, as defined in Section [III] For any A > 0, 
Pr(\p(H) - p(G)\ > A^AT) < 2e~ A2 / 2 . 

Proof: Our notations follow those of [ 34] . We first define 
an exposure martingale, which exposes one left-side vertex at 
a time, along with all its outgoing edges. This martingale is 
equivalent to a regular vertex exposure martingale, in which all 
right-side vertices are exposed first, and then left-side vertices 
are exposed one by one. 

Specifically, let G be the probability space of all two-choice 
bipartite graphs as defined in Section [HI] and / the size of the 
maximum size matching of a specific instance. Assume an 
arbitrary order of the left-side vertices L = {vi, . . .v n }, and 
define X ,...,X„ by X % {H) = E[f(G) | Vx < i^Vy e 
R, (v x ,v v ) e G iff (v x ,v y ) e H]. Note that X (H) = p{G) 
since no edges were exposed, while X n (H) = p(H) as all 
edges are exposed. 

Clearly, / satisfies the vertex Lipschitz condition since if 
two graphs H and H' differ at only one left-side vertex, 
\f(H) — f(H')\ < 1 (either that vertex is in the maximum 
matching or not). Thus, since each left-side vertex makes in- 
dependent choices, [34, Theorem 7.2.3] implies that the corre- 
sponding vertex exposure martingale satisfies |Xj_|_i— Xi\ < 1. 
Hence, by applying Azuma's inequality, we immediately get 
the concentration result. ■ 

Notice that if we are interested only in one-sided bounds, we 
can get a slightly tighter result: Pr(/i(G) — p(H ) > Xy/n) < 
e ' . This is exploited in the following corollary, which 
shows that to obtain a given overflow fraction, the needed 
stash size grows sub-linearly with n beyond its average value. 

Corollary 3: To achieve an overflow fraction of e in cuckoo 
hashing with stash, when inserting n elements to m bins, a 
stash of size n — p (G) + \/2n ■ In (1/e) suffices, where p(G) 

is defined in Theorem [TJ 

Proof: If a stash of size n— p (G) + \Jln ■ In 1/e is used, 
cuckoo hashing fails if and only if n — /u(-ff) > n — [i (G) + 
y/2n ■ In 1/e, or by rewriting it, p(G) — p(H) > \/2n ■ In 1/e. 
By substituting A = ^/2 • In 1/e in the above one-sided bound, 
we get the claimed result. ■ 

C. Limit Normalized Expected Maximum Matching Size 

We are now interested in the asymptotic expression where 
b4«d with a = — constant. The following results show an 



interesting connection between the limit normalized expected 
maximum matching size and the Lambert-Vl 7 function, and 
even a connection between the perfect matching threshold and 
the radius of convergence of the Lambert- W-' function |fl9ll . 

E2. 

For further details on the Lambert-W^ function, see also 
Appendix [B] 

Theorem 4: Let d = 2. The limit normalized expected 
maximum matching size 7 = linin^oo ^-^- is given by: 
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where the Lambert- W function is the inverse function of the 
function cj(x) — xe x . 

Proof: We compute the limit of t-^-L as n — > 00 such 
that a = -2-: 



7 = lim — ( m — y^ ( I ( I x 

n->oo n \ ■'— ' \sj \s + \j 
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g+l\ ' I S + 1 



P, 
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We find through differentiation that (l — ^-) is an 

increasing function with respect to n (where m = —), 
Moreover, the expansion of i • (") (™) • ( S ^ L ) S shows that it 
is also an increasing function. Therefore, their product is also 
increasing and, by the monotone convergence theorem 
we get 

7=--F limfi.^V m ) x 
n ~i n-s-oD\n \sj\s + lj 

s — v \ / \ / 



. , v. 2(n-s) / . , \ 2s 
S + 1 \ / S + 1 
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By substituting the expression for P s , and using the facts that 

(?) = IT + ° ( nS_1 ) and lim «^°c (1 + «/")" = e a , we 
deduce: 



m 1 ■t-^ n m stl 
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By substituting m = — , and simplifying the above expression, 
we get: 

\S-1 



7 



i "I V^ . _. (s + 1) p -2a( s +l) 
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a;-' be a formal power series, 



where by substituting x = —2a-e 2o we get the above 
expression. By differentiating T (x) and multiplying by x, we 
get: 

« • 4-T(x) = - 2 ^7, x 1 = -W (a:) , 



dx 



j'=l 



J 



where the Lambert-VK function is the inverse function of the 
function uj(x) — xe x J35l , and the last equality follows from 



its known Taylor expansion that converges as long as x is 
within the radius of convergence with |a;| < e" 1 ||35l . 
Given that x ■ 4^T (x) — ~W (x), we compute T (x): 



T(x) 
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dx " 
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(-W (x)) dx = -W (x) - -W 2 (x) , 



with convergence within |x| < e _1 . 

Interestingly, the function / (a) = —2a ■ e~ 2a gets its 
minimum at a = 0.5, where it precisely equals the radius 
of convergence — er 1 . Therefore, for all a we can substitute 
x = — 2a-e~ 2a , since we are within the radius of convergence 
of T (x), and we finally derive the result. ■ 

We note that this particular asymptotic result can be also 
achieved by the theory of giant components in random 
graphs ll34l . Il37ll . However, this technique is not applicable 
for finite n and m, and cannot be used to derive most of the 
other results in this paper. (A proof using this technique 
appears in the appendices). 

The following corollary shows that for a = — < |, the 
probability for a right-side vertex to be part of a maximum 
matching goes to 1. This corollary also follows from the 
previously known result that there is a perfect matching with 
high probability in cuckoo hash tables with load a < \ [7j. 

Corollary 5: Let d — 2 and a — — < i. Then the 
limit normalized expected maximum matching size is 7 = 



linir. 



KG) 



1. 



Proof: In case a < \,W {—2a ■ e 2q ) equals —2a, thus, 

7^ + 2^-(-2a) + 3^(-2") 2 = l " ■ 

V. Cuckoo with Low Memory Bandwidth: Bipartite 
Graphs With d v < 2 

In this section we are interested in a low-memory-bandwidth 
version of the cuckoo hash algorithm. We now let each element 
choose either 1 or 2 bins instead of only 2 bins, to force them 
to access less bins and use less memory I/O bandwidth. 

Formally, we relax the constraint that each vertex in L 
chooses exactly 2 vertices in R, and let each left-side vertex 
choose either 1 or 2 right-side vertices. Since we can divide 
the set of vertices either deterministically or randomly, we will 
discuss the results in both cases. These results correspond for 
example to cases in which the average number of choices, as 
defined below, is important (e.g. llT5l ). See also |[38l for a 
similar model. 

Note that further evaluation of the results reported in this 
section can be found in Section IVIII-BI 

A. Connected Components in Deterministic Graphs 

As in Section IIV-AI we now consider a deterministic 

bipartite graph H = (Ljj + Rh,Eh), with \Ljj\ = S and 

\Rh\ = <Z- We assume that the degree of each vertex in Lh 

is at most 2. 

Proposition 1: Lemmas [T] [2] and [3] hold also when the 

degree of each vertex in Lh is at most (but not necessarily) 

2. 

Note that the proofs remain almost identical to the original 

proofs, replacing a few equalities with the corresponding 

inequalities. 



Lemma 6: Let s + 1 = q. If H is connected then the degree 
of each vertex in Lh is 2. 

B. Expected Maximum Matching Size 

1) Predetermined Number of Choices: In this section, we 
assume that each vertex v £ L independently chooses 1 < 
d v < 2 random vertices in R, where d v is predetermined. The 
following result provides the expected maximum matching 
size in this case. 

Theorem 6: Given a predetermined average number of 
choices a, let d\ = (2 — a)-n and d 2 = n — d\ = (a — l)-n be 
the number of vertices in L that choose one and two vertices in 
R, respectively. The expected maximum matching size p, (G a ) 
is given by: 



"<°->— §(•)(•- 



2(d 2 -3)+d 1 



S+l 



(« + l) 



where b = min {^2, to — 1}. 

2) Random Number of Choices: In this section, we assume 
that each vertex v € L independently chooses 1 < d v < 2 
random vertices in R, where for each v £ L, d v equals 2 
with probability p, and it equals 1 with probability 1 — p. 
Based on Theorem[6] the following result reflects the expected 
maximum matching size in this case. 

Theorem 7: The expected maximum matching size p (G p ) 
is given by 



M(G P ) = E; =0 (; 2 )-p d2 -(i-^) 

p (G a ) is given by Theorem [6] 



n — d 2 



■ /'• 



( G a= i+4a) 



where 



C. Limit Normalized Expected Maximum Matching Size 

1) Predetermined Number of Choices: We are also inter- 
ested in the asymptotic expression, where n — > oo, such that 
we fix both the load a = — and the average number of choices 
a = 1+ 2 of the vertices. This is reflected in the following 
theorem. 

Theorem 8: The limit normalized expected maximum 

matching size j a — lim„_ i . 00 ^' ' with average number 

™ (O \ 

of choices a £ (1,2] is given by: 7 a = lim - " — 

n— ^oo 

1 W(-2a(a-l)-e~ aa ) W 2 (-2a (a - 1) • e 



n 

-aa \ 



4a 2 • (a - 1) 

n(G a ) _ 1 _ 1 _ Q 



For 



a 2a 2 -(a-l) 

a = 1, it is given by 7 a = lim„_ 

Interestingly, if even a small fraction of the elements do not 
have choice then the expected maximum matching size is not 
1. This is reflected in the following corollary. 

Corollary 9 ((No) Perfect Matching): If 1 < a < 2 then 

la < 1. 

2) Random Number of Choices: We now study the case 
of the random bipartite graph G p = (L + R, E), where each 
vertex chooses two vertices with probability p (and one vertex 
with probability 1 — p). As we show in the next theorem, the 
asymptotic expression can be derived by j a . 

Theorem 10: The limit expected maximum matching size 
7 p = lirrin^oo p ' where each vertex chooses two vertices 
with probability p (and one vertex with probability 1 — p) is 

7p = la=l+p- 



VI. Single-Ported Cuckoo: Static Partitioning of 
the Choices 

We now consider a popular cuckoo-hashing implementation 
variant in which the bins are statically partitioned into two 
equal sets, and each element holds one hash function to each 
set. This variant is easier to implement in hardware, because it 
can be implemented using two simple single-ported memories, 
instead of a single dual-ported one. 

Formally, we consider the random bipartite graph Gp = 
(L + (R u U Rd),E), where R is now partitioned into two 
disjoint subsets R u and Rd with \R U \ = j3 ■ m and \Rd\ = 
(1 — f3) m. Each vertex v £ L independently chooses a single 
random vertex in R u and another single random vertex in 
Rd- This corresponds, for example, to a hashing scheme that 
selects non-overlapping sets of bins as images of its hash 
functions (e.g., as in multilevel hashing scheme ||39l or d- 
left 11401). 

Note that further evaluation of the results reported in this 
section can be found in Section IVIII-CI 

A. Connected Components in Deterministic Graphs 

The following lemma counts all the possible bipartite graphs 
H u d of the form (Lh + (Rh u U RH d ),EH) with degree 2 for 
each vertex in Lh, where \Lh\ = s, \Rh u \ = i and \Rn d \ = 
j, such that each vertex v £ Lh is connected using a single 
edge to some vertex in Rh u and another single edge to some 
vertex in Rn d - 

Proposition 2: Lemmas Q] [2] [3j and [4] hold for this case as 
well. 

Lemma 7: Let s = i + j—1. The number T^.j of connected 



bipartite graphs is T,j 
(i + j-l)\ 



= P- 1 ■ f- 1 ■ s! 



iJ-l . ji-l . 



B. Expected Maximum Matching Size 

In the next theorem we find the expected maximum match- 
ing size with a static partition of the right-side vertices. 

Theorem 11: Given the static partitioning of the bipartite 
graph Gp, the expected maximum matching size /i (Gp) is 



»(Gp) 



-EC)Err)Cr +1 j :r) 






(l-/3)-m 

*' i,s-{-l — i i 



/3 • m 



/3 ■ m 

s + l — i 
(1 - P) -m 



where b\ = max {0, S + 1 — (1 — /?) ■ m}, 

b 2 = mm{s + l,f3 -m,}, P tj = ^+]-i , and 

Tij = f 7 ' -1 • j 1 ^ 1 ■ (i + j — 1)! (as given in Lemma |7|. 

Proof: Similarly to the proof of Theorem Q] our proof is 
based on counting the expected number of vertices in L that 
are not in some specific maximum matching M of Gp, based 
on the decomposition of G into its connected components. 
As in the proof of Theorem Q] we consider the number of 
connected components with exactly s vertices in L and q = 
s + l vertices in R u U Rd, where we have to sum over all 
possible combinations (i, s + 1 — i), where i corresponds to 
the number of vertices taken from R u and s+l — i corresponds 
to those taken from Rd- 



Thus, the expected number of connected components in Gp 
with s vertices in L, i vertices in R u and a + 1 — i vertices 
in Rd is given by: 



n \ iP-m\ (l-P)-m 
• / \ i I \ s + 1 — i 

n — s 



1- 



(l-j0)-m 



/3-m 





\ 7 

1 ] 


I — S 


5 


m J 




£ 


+ 1- 


- 7 



(1 - p) ■ m 



The above expression consists of the following factors (in 
order): 

(i) choosing the s vertices in L; 
(ii) choosing the i vertices in R u ; 
(Hi) choosing the s + 1 — i vertices in Rd', 
(iv) the probability that all i vertices in R u may be connected 
only to the chosen s vertices in L; 

(v) the probability that all s + 1 — i vertices in Rd may be 
connected only to the chosen s vertices in L; 
(vi) the probability that all s vertices in L are only connected 
to the i vertices in R u ; 

(vii) the probability that all s vertices in L are only connected 
to the s + 1 — i vertices in Rd', and, 
(viii) the probability that all chosen vertices are connected. 

Finally, adding the expressions for all possible s's and i's 
and subtracting it from m yields the claimed result. ■ 



C. Limit Normalized Expected Maximum Matching Size 

As in the last sections, we are also interested in the 
asymptotic expression where n — > oo with both fixed a = — 
and fixed f3. This is achieved in the following theorem. 

Theorem 12: Given the static partitioning of the bipartite 
graph Gr, the limit normalized expected maximum matching 

size 7^ = lim^oo ^^ for /3 € (0, 1) is given by: 7^ = 



p-a-P) 



(h + h —t\ -t%), where t\, *2 are provided by 
the following equations 



1-/3 



tx 



a — s_ 
_. e l .„ 



t 2 



(2) 



and satisfy the condition t\ ■ ti < 1. 

For j3 £ {0, 1}, (namely, the trivial partitions), the limit 
normalized expected maximum matching size 7^ is — — — ■e~ a . 

We deduce the following two corollaries. 

Corollary 13 (Asymptotic Equivalence): Let d = 2. The 
limit normalized expected maximum matching size of Gp with 
j3 = 0.5 is the same as the limit expected maximum matching 
size of G. 

Proof: We substitute j3 = 0.5 in the expression from 
Theorem [12] and get rp 



0.5 



h 



0.5 



1 2 • e x . One of the solutions of the above equations is t\ = 
ti = —W (— 2ae~ 2a ). In the proof of Theorem|U we showed 
that -W (-2ae~ 2a ) < 1. Thus, t\ • t 2 < 1. By substituting 
this solution in the expression for 7^ from Theorem Q~2] , we 
get the exact expression as in Equation dTJ. ■ 

Corollary 14: Let d — 2, a < i, and fix a partition j3. 
The limit normalized expected maximum matching size ja — 
linw+oo ^^ is 1 whenever ^~^\-^ 2 < /3 < i+Vi-4a^ _ 
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Fig. 2. Expected maximum matching size for various values of n and rn 
(normalized by n) 



VII. Super-Cuckoo: Bipartite Graphs with d > 2 

We are now interested in checking how powerful cuckoo 
hashing can be when we allow more than 2 hash functions 
per element. Of course, using more hash functions will result 
in an increase in implementation complexity, and therefore 
one goal of this study is to point out the tradeoff between 
efficiency and complexity. 

In this section we briefly show how our method can be 
applied to find an upper bound on the expected maximum 
size matching where each left-side vertex has d > 2 choices. 
Formally, we are given two disjoint sets of vertices L and R of 
size n and m, respectively, and a random bipartite graph G d = 
(L + R,E), where each vertex v £ L has d outgoing edges 
whose destinations are chosen independently at random (with 
repetition) among all vertices in R. We obtain the following 
upper bound on the maximum matching size of the bipartite 
graph G d . 

Theorem 15: Let b — min < n, ^jErt \ an d 1 = (d — 1) • 
a + 1. Then, /1 (G d ) is lower or equal to 



n\ fm\ f q \d(n-s) / q \ds d a ■ q\ 

~ mi vj g (d-l). s +2 



inin,m-^T i (q-s) ( J ( 



s'M 



An evaluation of the upper bound and a comparison to 
the simulated expected matching size is presented in Sec- 
tion IVIII-DI 

VIII. Evaluation and Experiments 

A. Expected Maximum Matching Size With d = 2 

Figure [2] shows the expected maximum matching size 
normalized by n for various values of n and m. We show 
the expected maximum matching size both via our analytical 
model from Theorem[T]and via simulations. For each instance 
of n and m, we randomized m — 10,000 bipartite graphs. The 
results fairly confirm that our model is accurate, and also show 
the convergence of the expected maximum matching size to its 
limit. A simple evaluation appears in the following example. 

Example 1: In case n = m = 2 (and d — 2), the expected 
maximum matching size is p, (G) = ¥ = 1-875. This simple 
result can be justified as follows: In all cases the maximum 
matching size is 2, except for the two cases of maximum 
matching of size 1, where all 4 edges are connected to a 
specific vertex in R. Each such case occurs with probability 



'1\4 



-i> • Hence,/x(G)=2-^ 6 h; s 



15 



Figure [3] shows the expected maximum matching size nor- 
malized by n as found in Theorem |4] for various values of 
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Fig. 3. Limit expected maximum matching size for various values of load 
ct, normalized by n 
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Fig. 4. Limit expected maximum matching size for various values of a and 
a, normalized by n 



load a, both via our analytical model and via simulations. The 
simulations were performed using m = 1000 and n = a ■ m. 
For each value of a, we randomized 100 bipartite graphs. The 
results fairly confirm that our model is accurate. 
We conclude by the following simple example: 
Example 2: In case a = 1, that is n = m, the normalized 
limit expected maximum matching size is 



7 = 1 + - • W (-2 • e" 



-W 2 (-2 • e- 2 ) sw 0.8381. 
4 v y 



B. Expected Maximum Matching Size With d v < 2 

Figure [4] shows the normalized limit expected maximum 
matching size, for various values of load a and average 
number of choices a, both via our analytical model (from 
Theorem 2|l as well as via simulations. The simulations were 
performed using m = 1000 and n = a ■ m, where for each 
instance of the simulation we randomized 100 bipartite graphs. 
The results fairly confirm that our model is accurate. 

C. Expected Maximum Matching Size With Static Partition 

Figure [5] shows the limit expected maximum matching size 
normalized by n, for various values of load a and partition 
j3, both via our analytical model (from Theorem [T2l and via 
simulations. The simulations were performed using m = 1000 
and n = a ■ m. For each pair of values of a and j3, we 
randomized 100 bipartite graphs. The results fairly confirm 
that our model is accurate. 

As expected, the limit expected maximum matching size is 
symmetric around j3 = 0.5. In case a = 0.5 and j3 < 0.5, 
while it seems that the normalized limit expected maximum 
matching size is 1, it is not the case. For instance, in case 
a = 0.5 and /3 = 0.45, we get that 1 - jp « 1.675 ■ 10~ 7 . 
However, there are cases where imbalance in the partition sizes 
does not reduce 73, as shown for instance in Corollary [T4l 
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Fig. 5. Limit expected maximum matching size for various values of /3 and 
a, normalized by n 




Fig. 6. Upper bound on the normalized expected maximum matching size 
for a = 1 as a function of d 



D. Expected Maximum Matching Size With d > 2 

We evaluate the upper bound found for the expected match- 
ing size (Theorem [15). Figure [6] shows our upper bound as 
well as simulation results for various values of the number of 
choices d. We took n = m = 100, while for each instance of d, 
we randomized 10 5 bipartite graphs. In the case of d = 2, our 
upper bound matches the exact expression found in TheoremQ] 
and thus matches the simulation results. In addition, we can 
compare simulation results for higher values of d with our 
bounds. For instance, in the case of d = 3 the normalized 
expected maximum matching size via the simulation is 0.9402, 
while our upper bound is 0.9508. In case d — 4, we get 
a simulation value of 0.9795, while the corresponding upper 
bound is 0.9820. 

E. Trace-Driven Experiments 

We have also conducted experiments using real-life traces 
recorded on a single direction of an OC192 backbone link Pffl . 
where packets are hashed using a real 64-bit mix function [42). 
Our goal is two-folded. First, we would like to verify that our 
analysis agrees with results of real-life traces. And second, 
we want to verify that the distribution of overflow list size is 
highly concentrated around its mean, as stated in Theorem [2] 

We took m = 10,000, and set a number of elements n as 
corresponding to various values of load a. We repeated each 
experiment 100 times. Fig. [7] shows that the results of our 
experiments are very close to our model. Furthermore, it also 
shows the minimum and the maximum overflow list size as a 
function of the load a, thus, introducing a confidence interval 
of 98% for the case where m — 10,000. Note that, as reflected 
in Thoerem [2j if we increase m (and set n accordingly) then 
the confidence interval narrows down. 

IX. Conclusion 

In this work, we analyzed the performance of cuckoo 
hashing with a stash for loads above 0.5. We first provided 
an exact expression for the expected maximum matching size 
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Fig. 7. An experiment using real-life traces 



of a random bipartite graph with each left-side vertex picking 
d = 2 right-side vertices. Then, we deduced asymptotic results 
as the memory size goes to infinity, and showed a connection 
to the Lambert-iy function. 

Both these results directly apply as exact results for the 
average number of inserted elements using cuckoo hashing. 
They also help us size the stash needed in the algorithm. 
In addition, they serve as an upper bound for any alternative 
hashing algorithm. 

We also discussed alternative cases, in which cuckoo either 
uses a lower memory bandwidth to gain power, or uses a 
higher memory bandwidth to gain in efficiency, as well as a 
case in which memory is partitioned and can be implemented 
using two single-ported memories. Finally, we evaluated our 
results on Internet backbone traces. 

As future work, our goal is to implement the algorithm 
in FPGAs, and evaluate its performance according to the 
measures accepted in the switch industry (e.g., mean time to 
failure vs. power utilization resulting from the CAM stash). 
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Appendix A 
Omitted proofs 

A. Proof of Lemma Q] 

The proof follows by induction on s. For s = 1, there are 
2 edges in the graph and therefore every graph with q > 3 is 
not connected. Assume that the claim holds up until s = s' , 
we next prove that it holds for any bipartite graph H 1 such 
that \Lh'\ = s' + 1 and \Rh'\ > s' + 3. Assume towards 
a contradiction that there is a graph H' that is connected. 
We first show that there is a vertex in Rjji with a degree 1: 
This follows from the fact that the average right-side degree 



2(V + 1) 
s'+3 



< 2, implying that there is at least one vertex with 
degree strictly less than 2; since the graph is connected, there 
are no right-side vertices with degree 0. Let v r be such a vertex 
and let V( £ Ljj> be the (only) left-side vertex to which it is 
connected. By the induction hypothesis, the graph induced by 
Lh' \ {vt} and Rh> \ {v r } is not connected, implying it has 
at least two connected components. In H 1 , vg is connected to 
v r and since its degree is 2 it can be connected only to one of 
these components. This implies that H' is also not connected, 
and the claim follows. ■ 



B. Proof of Lemma \2\ 

We first consider the case where s = q. For S C Lh, 
let d(S) C Rjj be the set of vertices that are adjacent to 
any vertex in S. Hall's Theorem [43] implies that to prove 
that /i (H) = q (namely, there is a perfect matching in H) 
it suffices to prove that for every S C Lh, \S\ < \d(S)\. 
Assume towards a contradiction that there is a subset S C Lh 
such that \S\ > \d(S)\, and denote \d(S)\ by b. Furthermore, 
consider the bipartite graph H = I Lh + Rh,Eh), in which 

L H = L H \ S, R H = %U {v R } \ d(S) (where v R is a 
newly-introduced vertex) and any edge in E(H) of the form 
(ve, v r ) such that vg £ Lh \ S and v r £ d(S) is replaced with 
the edge (ve, vr) in Eh- Notice that since H is connected, H 



must be connected as well. Recall that |5| > b, thus 



L 



H 



= \R H U{v R }\d(S)\ = 
This contradicts Lemma Q] 



\Lh\S\ < s-b-1, while R H 

\Rh\- \d(S)\ + l = s-b+l. 

implying that for every S C Lh, \S\ < \d(S)\ and by Hall's 

Theorem ft (H) = q. 

For s > q, trivially 11(H) < q. Therefore, it suffices to 
show that there exists a subset S C Ljj of size q, such 
that the corresponding bipartite subgraph is connected (and 
hence has a perfect matching of size q). We construct S in q 
iterations such that at the end of iteration n we end up with 
some subsets S n C Lh and Q n C R H of the same size 
n, whose corresponding subgraph is connected. We start by 
n = 1 and pick some vertex vr £ Rh and one of its adjacent 
vertices vl £ Lh- Assuming that at the end of iteration n, 
sets S n and Q n were chosen (and their corresponding graph 



is connected), we next construct S n +i and Q n +i- Let v\ be 
an arbitrary vertex in S n and let V2 be an arbitrary vertex 
in Lh U S n (such a vertex always exists since s > q > n). 
Similarly, let v[ be an arbitrary vertex in Q n and let v' 2 be an 
arbitrary vertex in Rh U Q n . Since H is connected there is a 
path between v\ and v-i, and let v be the first vertex along this 
path that is not in S n . Similarly, v' is the first vertex along 
the path between v[ and v' 2 that is not in Q n . We differentiate 
between three cases: (i) v is adjacent to Q n and v' is to S n . 
In this case S n +i = S n U {v} and Q n +i = Qn U {v 1 } and the 
corresponding subgraph is connected; (ii) v is not adjacent to 
a Q n . Let w be the vertex before v in the path between v\ and 
V2, and let w 1 be the vertex before w in the path. Note that 
w' £ S„ by the choice of v, and that w $■ Q n (otherwise 
v is adjacent to a Q n ). Thus, for S n +i — S n U {v} and 
Qn+i = Qn U {w}, the corresponding subgraph is connected; 
(Hi) v 1 is not adjacent to a S n . The claim holds similarly to case 
(ii) by looking at the path between v[ and v' 2 - We continue this 
construction for q iterations, resulting in two subsets S q C Lh 
and Q q C Rh of size q each, whose corresponding subgraph 
is connected. ■ 

C, Proof of Lemma \3\ 

Since each vertex in Lh has a degree of two, the sum of the 
degrees of all the vertices in Rh is 2s = 2q — 2. Therefore, 
there must be at least one vertex v r £ Rh with degree 1 
(there cannot be a vertex with degree since H is connected). 
Let Vl £ Lh be the (only) vertex that is connected to vr 
and vr £ Rh be the other vertex that is connected to vl- 
Also consider the bipartite graph H = (Lh + Rh, Eh) that 
is given by removing vr from H and adding a new edge 
{VL-, Vr)- By the construction of H, the degree of each vertex 
in Lh is exactly 2. Moreover, since H is connected, H is also 
connected. Hence, Lemma|2]implies that there is a matching of 
size s in H. By the construction of H, this is also a matching 
in graph H . ■ 

D. Proof of Lemma @ 

First, if H is a tree then it is connected by definition. To 
show the other direction, we assume towards a contradiction 
that H is a connected graph with cycles; let C be a cycle 
in H, and consider an edge e = (vl,vr) that resides at 
cycle C (where vl £ Lh and vr £ Rh)- We build the 
bipartite graph H = ILh + Rh,Eh), such that L H = L H , 

Rh = Rh U {vr}, where vr is a newly-introduced vertex, 
and Eh = Eh \ {e} U {e}, where e = (vl,vr). Intuitively, 
we replace one of the edges in the cycle to reach for a newly- 
introduced vertex, and by that we increase the size of the 
connected component. Notice that H is connected and all 



L 



H 



< 



vertices in Lh have a degree of 2. But, 

thus contradicting Lemma [T] and the claim follows. 



Rh 



1. 



E. Proof of Lemma \5\ 

We count the connected bipartite graphs with two disjoint 
sets Lh and Rh- By Lemma [4] we have to count the number 
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of trees over the set Lh U Rh, where edges must be of the 
form (vl,vr), such that vl E Lh and vr e Rh- We build 
(and count) the set as follows: The number of trees over the 
set Rh is (s + l) 1 . For each such tree instance, we put a 
new vertex (originally from Lh) between each pair of adjacent 
vertices. There are s! possibilities to do so. ■ 



F. An Alternative Proof of Theorem 

Considering the random graph with m vertices and n edges 
such that a vertex mi is connected to vertex m 2 if and only 
if there exists an element that hashes into rti\ and m%. This 
random graph is called the cuckoo graph |9|. Neglecting the 
0(1) loops, this graph is equivalent to the Erdos-Renyi random 
graph G m . n that assigns equal probability to all graphs with 
exactly n edges (and m vertices) 

A matching in G m ,n corresponds to directing some of the 
edges in the random graph such that the in-degree is at most 
1. For each connected component C in G m>n , if C is a tree 
we can direct all edges, while in all other cases we can direct 
as much edges as the number of vertices. 

The number of such edges and vertices can be found in [ 
1371 , yielding the exact same result. 



G. Proof of Lemma [7] 

The proof is identical to the proof of Lemma [4] with two 
modifications. First, instead of initially counting the number of 
trees over the set Rh, we count the number of parity trees [44| 
over the disjoint sets Rh u and Rn d ■ By [44] we are given that 
the number of parity trees is V^ 1 ■ j 1 ^ 1 . Second, we do not 
have to color the edges because of the partition. ■ 



H. Proof of Theorem [72] 

As in the proof of Theorem |4] we compute the limit of 



KG) 



as n — > oo. We consider the case where a = — and 



< /3 < 1 are fixed. So -fp = lim„ 



*£&, that is, 



7/3 



n-joo n I Z_/ \ S J 2-s V i )\ s + l—i ) 

\ s — i=bi 



/3-m, 
s + l — i V 
(l-/3)-m 



s+l-i \ n 



(l-/9)-m 



/3 • m 



■Pi, 



By substituting the expression for Pi , s +i—t from Theorem fTTl 
and moving (™) inside the second summation, we get: 

/ 1 i ™ i±i ,m //3m\ /(l - 0) m\ ( i N "~ s 

7/3 = lim x v i w \i 

n— ^oo \ a n 



EEO(T)(';;riT) ' 



= i=0 

s+l-i \ n 



/3rn 
s + 1 - i 



(l-P)-mJ \li-mj \{\-fi)-m 

(s+i-i)-i . ( a + i _ j)'- 1 . (i + (^ + i _ j) _ i)i 

(i-(s + l-i)) i+( - 3+1 - i '>- 1 



By substituting a = — , we get: 



7/3 



/ n . n s + 1 B 1-/3 / 

lim I-irvn« n )( — B . i-f 



1 - 



1-/3 



1-/3 



j(»+i-i)-i . ( s + l - i)*" 1 . (j + ( fl + l -i) - i)| 
^-(s + l-i))*- 1 -^- 1 - 1 -^- 1 

As in the proof of Theorems |4] and [8] using the monotone 
convergence theorem ll36l . we can put the limit inside the 
sum. By further simplifying the above expression with similar 
consideration to the proofs of Theorems [4] and [HJ we get 
eventually: 

, .+1 .( s+ l-i)-l ,( g + 1 _ i y-l 



= ~ ~2 1^1^ 

s=0 i=0 

a 



(\ 



(\- 



i\- (s+l-i)! 



P 



■P 



We switch the order of summation and get that i € {0,1,...} 
and s goes from max{0, i — 1} to oo. We also substitute j = 
s + l—i(oTs=i+j — 1). Thus, 



7/3 



1 

a 



/?•(! — /?) 



E E 

i— j-max{0,i-l} 



i^-i.ji-i 



(3) 



1-/3 



e /3 



. e l-^ 



-j-l .i— 1 

Let T (a;, y) = J2j+i>i ' ii.'ji ^ ■ J/ 1 '- This expression has 

been previously found [ 7 ] to be the multivariate formal power 
series about the point (xo,yo) — (0,0) of t(x, y) — t\ (x,y) + 
t 2 {x, y) - h (x, y) ■ £ 2 (x, y) where t\ (x, y) and t 2 (x, y) are 
given by the following implicit multivariate functions: 



x = t l {x,y)-e- t ^ x ^ , y = t 2 (x,y) 



-ti(x,y) 



(4) 



However, the mentioned range of convergence in (7J is insuf- 
ficient for our case. (Note also that in [7] the sums should be 
over i + j > 1 and not over i, j > 0.) 

Since we compute the limit normalized expected maximum 
matching, then the expression for 7^ in Equation (O is 
bounded from below by 0, thus, by Equation (0 the double 
summation is bounded from above by a constant. On the other 
hand, all terms in the summation in Equation Q are positive. 
Then, if we look at the partial-sum series (by defining an 
arbitrary order), we get an increasing series which is bounded. 
Thus, by the monotone convergence theorem the double series 
converges for any values x and y satisfying x — j^g ■ e~~$ 

and y = j ■ e~^ . 

However, the multivariate functions in Equation (01 have 
multiple branches (as the Lambert-VK function does [35]), that 
is, for a given x and y there is more than one solution. We 
aim to find this branch in terms of t\ and t<i. We use the 
implicit function theorem to find the derivatives singularities. 
The Jacobian is given by 

e -Mx,y) -ti(x,y)-e- ta( - x < v ^ 

-t 2 (x,y)-e^ tl ^' v '> e -ti(*,v) 

and it is invertible wherever \J\ ^ 0. Thus, there is a derivative 
singularity in case t\ (x, y) ■ t 2 (x, y) = 1, which is the 



J 
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only solution. Therefore, as the given formal power series 
in Equation (01 is about the point (xo,yo) = (0,0) (which 
corresponds toa = 0), where t% = t% = 0, it converges to the 
branch where t\ (x, y) ■ t 2 (x, y) < 1 (note that both t\ (x, y) 
and ti (x, y) are always positive). ■ 



1-/3' 



/. Proof of Corollary [74] 

One of the solutions to Equation (O is given by: t\ 
£2 = %■ By substituting t\ and t 2 in the expression for 7^ 
from Theorem [12] we get that the limit normalized expected 
maximum matching size is 1. We also have to verify that t\ ■ 
£2 < 1- Since j-2l_ and ^ are both positive, we are left with 
y3g • § < 1. By solving the quadratic inequality, we get the 
claimed condition. Note that for a = 1/2 the range reduces 
to $ = 1/2. ■ 



J. Proof of Lemma [6] 

Assume on the contrary that H is connected but that there 
is (at least) a single vertex Vl € Lfj with degree 1. Consider 
the bipartite graph H = (L H + Rh,E h \ that is given by 
removing the vertex vl (and its connected edge) from H . 
By the construction of H, we get that H is connected, but 
Lh + 1 < Rh , which contradicts Lemma Q] ■ 



K, Proof of Theorem \6\ 

As in the proof of Theorem [T] our proof is based on 
counting the expected number of vertices in L that are not 
in some specific maximum matching M of G, based on the 
decomposition of G into its connected components. The proof 
is almost identical, with the modification that, due to Lemma|6] 
we only take into account the d 2 vertices that have a degree 
of 2 (instead of all n vertices in the proof of Theorem [TJ. 

Thus, the expected number of connected components in G 
with s elements in L and s + 1 in R is given by: 



1 



1 



2(d 2 -s)+di 



1 



2.s 



P, 



where the above expression consists of the same considerations 
as in the proof of Theorem Q] Finally, as before, adding the 
expressions for all possible s's and subtracting the sum from 
m yields the claimed result. ■ 



L. Proof of Theorem 

The number of vertices in L with degree 2 follows a 
Binomial distribution with n experiments and a probability of 
success p. In Theorem [6] we found the expected maximum 
matching size of each such instance. Thus, by the law of 
total expectation, the claimed result is given by computing 
the weighted average, where we compute a by the equations 
d\ + d 2 = n and d\ + 2 • d% = a • n. ■ 



M. Proof of Theorem \8\ 

We compute the limit of n as n — >• 00. We consider 
the case where a = = and a = dl+2 ' d2 > 1 are fixed. So 
7 a = lim„ 



M(Ga) fh t : 

in->oo — ~ — > mat is, 



7 a = lim - [ m — V^ 



d.2 \ I rn 



s \s + l 



s + l 



2(d 2 -s)+d 1 



S+l 



P, 



- d\ + d%, we find that 
n. Similarly to the proof 



Given that a = dl+2 ' d2 and n 

n 

di = (a — 1) • n and di = (2 — o) 

of Theorem |U we first have to find that each term in the 
summation is an increasing function with respect to n. We 
discover that (l - £+i) 2 (^)+^ = / _ s+iyn-s ^ 
increasing function (using differentiation), and also find that 
I . (' a- s 1 '' ra ) ( ™ ) • ( S ^ L ) is an increasing function as previ- 
ously. Consequentially, each term in the sum is an increasing 
function and, by the monotone convergence theorem |36], we 
can put the limit inside the sum. By further simplifying the 
above expression as in the proof of Theorem @] we eventually 
get: 



7a = 



2q 2 -(o-l) 

Let T(x) = J2T=i 



E 



(-J) 



i-2 






1! 



(-Q-2-(a-l)-e- aa ) J 



• x 3 be a Taylor expansion, 
where by substituting x = —a • 2 • (a— 1) • e~ aa we get 
the above expression. Similarly to the proof of Theorem |U 
we get that 

T (x) = -W (x) - X -W 2 (x) , 

with convergence within \x\ < e~ l ll35ll . 

Since the function / (a) = —a • 2 • (a — 1) • e~ aa gets 



its minimum at a = a 1 , where it equals 



2(a-l) 

. / 



_1 , and 



2(q-i) r -i 

a e 

substitute x - 



< e l for all a £ [1,2], then for all a we can 



-a- 2- (o-l) 



Hence, it is within the 



radius of convergence of T (x). 

Finally, for the case where a = 1, then d 2 = and d\ = n. 
Therefore, the expression for the expected maximum matching 
size is reduced to m — (m ■ (l — -) ). Thus, 



la 



= lim 



A*(Ga) 



n— >oo 

1 1 



= lim — • 

n— >oo fl 



1 
1 

m 



N. Proof of Corollary® 

We show that -f a is strictly monotonically increasing, thus 
7 a < 1 for 1 < a < 2, since j a = 1 for a = 2. This is shown 
by differentiating 7 a with respect to a: 

dja 1 

da ~ 4a 2 (a - l) 2 

W(-2a(a-l)-e- aa ) 

1 



(W (-2a (a - 1) ■ e~ aa ) + 2a (a - 1)) 



Both the first factor 



Aa 2 (a-iy 



and the third factor 



W (— 2a (a — 1) ■ e aa ) are negative. Thus, if the second 
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factor is positive then -^ is an increasing function with 
respect to a £ [1,2). 

If a > 0.5, then 2a (a — 1) > 1, and since W (x) is 
minimized for x = — - where it equals — 1, the second 
factor is positive. On the other hand, consider that a < 
0.5. Since W (-2a (a - 1) • e" 2 "^" 1 )) = -2 (a - 1) a and 
W (x) is an increasing function, then we have to show 
that -2a (a- 1) • e -2«(«-i) < -2a (a- 1) • e~ aa , that is, 
—2a (a — 1) > -aa. The last inequality can easily be shown 
for 1< a < 2. M 



O. Proof of Theorem [70| 

We compute the limit of ^— E 

/i(G p ) 



as n-> oo. 



7p 



= lim 



n— too Tl 

d 2 =0 V ^ 



'» 



i— d-2 



lim 

n— >oo 



d2— [_ n P~ n ~^ J "t"l 



Let X ~ Bin (n,p) be the random variable counting the 
number of vertices in L that choose 2 vertices in R. By 
summing over three disjoint ranges of possible values for d%, 
we get 

3 

[np — ro^ J 

lv = lim V Pr{X = d 2 }-±-/x(G „ 2 ) + 

3 

I 77,T> + n 4 J — 1 

E ^{X = d 2 }-^^G^ 1+ ^) + 

3 71 

n,p — n - ^ J +1 

n .. 

By Chebyshev's inequality we get that 

Pr < \X — np\ > n* Jnp (1 — p) I < -V. Since 

P (1 - P) < 1> we get that Pr J IX - np\ > n§\ < -\. By 

the fact that ^ • M ( G Q=1+ ^ ) < 1, we find that the first and 

the third limits go to zero. 

Since the function /i (G a ) is increasing with respect to a 
(this can be shown by a simple combinatorial argument), we 
get the following lower bound: 



lim 

n— >oo 



II 



[np+n^ J —1 

lv = lim Y. p r{* = d 2 }---M(G a _ H 

n— >oo * — ' 77. \ °— ■■'■ ' 

3 
d,2 = [rip — n 4 J +1 

> lim [i_ J_ ] . ~.u\ G 

n->oo y U i J n y a=1+ lntzili±l 

as well as the following upper bound: 

3 

[np+n 4 J —1 

lp = lim V Pr{X = d 2 }--. M (G a _ 

3 



< lim 1 n I G 

n— K30 n 



jnp + n4 J-l 



f! Proof of Theorem [75] 

We first establish a few lemmas before proving the result. 
As before, we start by considering a deterministic bipartite 
graph H = (Ljj + Rh, Eh) with degree d of each vertex in 
Lh, where \Ljj\ = s and \Rh\ = <?■ 

Lemma 8: If (d — 1) ■ s < q — 2, then H is not connected. 
Proof: As in the proof of Lemma Q] the proof follows 
by induction on s. For s = 1, there are d edges in the graph 
and therefore every graph with q > d + 1 is not connected. 
Assuming that the claim holds up until s = a', we next prove 
that it holds for any bipartite graph H' such that \Ljj'\ = 
s' + 1 and \R H i \ > (d — 1) • (s' + 1) + 2. Assume towards a 
contradiction that there is a graph 7J' which is connected. 

We first show that there are d— 1 vertices v ri ,v r2 , . . . , v rd _ 1 
in Rh', all of a degree 1 such that they are connected to the 
same vertex vt £ Rh 1 '- The sum of right-side vertex degree 
is d- (s' + 1). Also, since the graph is connected there are no 
right-side vertices with degree 0. This implies that there are 
at least (d— 2) ■ (s' + 1) + 2 vertices of degree 1, thus there 
exists a vertex vi £ Rh> as claimed. 

By the induction hypothesis, the graph induced by Ljj' \ 
{vi} and Rh> \ {v ri , v r2 , . . . , v rdl } is not connected, which 
implies that it has at least two connected components. In H' , vg. 
is connected to all vertices v ri ,v r2 , . . . , v rdl . Since its degree 
is d it can be connected only to one of these components. This 
implies that H' is not connected as well, and the claim follows. 

■ 

Lemma 9: If H is connected and (d — 1) • s = q — 1 then 
H (H) = a. 

Proof: Assume towards a contradiction that [i (H) < s, 
and consider some maximum matching M. Let vi £ Lh 
be a vertex that is not in the maximum matching M, 



and v r 



be the vertices in R (which are not 



necessarily distinct) that are connected to i}£. All vertices 
V ri , V r2 , . . . , v rd _ 1 are connected also to another vertex in Lh, 
otherwise vg. was in the maximum matching M, 

Consider the bipartite graph H = / Lh + Rh, Eh ), which 
is given by removing vg from H. Since the right-side vertices 
Vr-, , Vr*, v r , , are also connected to the other left-side 



vertices (except vg), the 
However, we get that 



L 



H 



bipartite graph H is connected. 



Rh 



= (d-l)-s + l, 



By the squeeze theorem, we get the claimed result. 



= s — 1 and 
which contradicts with Lemma [8] 

We note that in contrast to Lemma [2] the corresponding 
proposition is not true for d > 2; that is, if H is connected 
and s < q, then the maximum matching size is not necessarily 
s. As a counter example, consider the case where d = 3 and 
s = q = 3, where two left-side vertices choose the same 
single right-side vertex (using all their 3 choices), and the 
other left-side vertex chooses all 3 right-side vertices. The 
resulting bipartite graph is clearly connected, but the maximum 
matching size is only 2 (only one of the first two left-vertices 
can be in the matching). 

Lemma 10: If (d — 1) • s = q — 1 then H is connected if 
and only if it is a tree. 

Proof: The proof consists of the exact same construction 
H as in the proof of Lemma @] where we eventually get a 
contradiction with Lemma [8] ■ 
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Fig. 8. the Lambert-VF function 
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Appendix B 
The Lambert- W Function 

The Lambert-W function, usually denoted by W (•), is 
given by the following implicit representation: 

z = W(z)-e w{ - z \ 

where z is a complex number ll35l . 

For real valued arguments, i.e. z is real valued, W (z) has 
two real-valued branches: the principal branch, denoted by 
Wo (•) and the branch W-\ (•). Figure [H] shows the two real- 
valued branches. For instance, Wq (— e _1 ) = W_i (— e _1 ) = 
-1 and Wo (0) = 0. 

Note that the notation W (•) usually relates to the principle 
branch, i.e. Wq (•). Thus, although one would expect that for 
real-valued z,W (z ■ e z ) = z, this is only the case for z > — 1; 
in case z < — 1, W-\ (z ■ e z ) = z ^ W (z ■ e z ). 



Lemma 11: The number Tf of connected bipartite graphs 
H whose \L H \ = s and \R H \ = 2 (d - 1) • s + 1 is T s d = 

Proof: By Lemma [10] we have to count the number of 
bipartite trees over the two disjoint sets Lh and Rh of size s 

and (d — 1) 1-1, Since 7J is a tree, then there are no cycles. 

Consequently, each one of the vertices in L# is connected to 
d distinct vertices in Rh- Moreover, no two vertices in Lh 
share more than 1 vertex in Rh- For each vertex ty e Lh, let 
S v be the set of the d right-side vertices that V£ is connected 
to and also let the cycle C Vl be a cycle that consists of the d 
vertices of S v . 



Consider the graph H = (Rh,Eh), which is given by 
connecting each cycle C Vt to C Vt using a common vertex 
v r if and only if v r is connected to both ve 1 and ve 2 . The 
resulting graph H is a Husimi graph over (d — 1) • S + 
1 vertices, where the number of such (labeled) graphs is 

%z%t l » ((d-i)-s + iy- 2 m. 

Finally, each set S v is determined by the (labeled) vertex 
in Rl- Thus, we multiply by s! the above expression. ■ 

We are now able to prove the result. 

Let M be a maximum matching of G. Similarly to the proof 
of Theorem Q] the proof is based on counting the expected 
number of vertices in R that are not part of M, and on the 
decomposition of G into its connected components. 

We count the expected number of connected components 
with s left-side vertices and q = (d— 1) • s + 1 right-side 
vertices. By Lemma [9] the maximum matching size of each 
such connected component is exactly s. Thus, there are q — s 
right-side vertices that are not in M. 

Let H be a bipartite graph H = (Lh + Rh,Eh), with 
degree d for all vertices in Lh, where \Ljj\ — s and \Rh\ = q- 
The probability P s that H is connected is given by P s = 

(d!) s T a d 
— 7&~i — ■ 



The remainder of the proof is similar to the proof of 
Theorem [T] ■ 



